🎮 Reinforcement Learning - buckman · Scour

🎯RLHF DEV Community·

RLHF vs DPO vs IPO vs KTO: which alignment method should you use

Discussed on DEV

🤖Large Language Models spectrum.ieee.org

·

IEEE Rolls Out Large Language Models Virtual Training Course

Covers 4 stories including How to Compress DICOM (.dcm) Images from 1.4 MB to KB Using Python?

🐙Cosmic Horror lesswrong.com·

The LLM shoggoth meme is weirder than you think

Covers 3 stories including An open letter to all future readers who think they've found consciousness in their AI

🎯RLHF Interconnects·

Frontier post-training recipe review with Finbarr Timbers

Covers 10 stories including DeepSeek-V3 Technical Report

🔒Security That Privacy Guy! Blog·

I told them forced consent was unlawful. Five years later it cost Elkjop €1.8 million

Discussed on Hacker News

🎯RLHF DEV Community·

The Three Phases of Post-Training: How LLMs Learn to Provide Sensible Responses

Discussed on DEV

🤖AI venturebeat.com·

Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again

Covers 6 stories including Anthropic/Claude AI is down

Covered by 3 sources including tldr.tech, AI Changes Everything

Discussed on Hacker News

📈Algorithmic Trading DEV Community·

Building a Self-Optimizing Python Trading Bot with Reinforcement Learning and Binance API

Discussed on DEV

🤖AI lesswrong.com·

What are some angles of attack for making continual learning safer?

Covers 2 stories including Claude's Constitution

🧠Claude DEV Community·

You don't pick the RL algorithm — SIA's Feedback loop does

Covers SIA: Self Improving AI with Harness & Weight Updates

Discussed on DEV

🔬Anthropic lesswrong.com·

How I think developers of frontier AI systems and regulators ought to act in the face of existential AI risk

Covers 2 stories including [2212.08073] Constitutional AI: Harmlessness from AI Feedback

📊Compute Markets DEV Community·

Build a GDPR-Compliant AI Pipeline with Intel TDX — Step by Step: 3 Hours vs 6 Months

Discussed on DEV

🧠Context Engineering lesswrong.com·

Synthetic document finetuning for instilling positive traits

Covers 2 stories including Agentic Misalignment: How LLMs could be insider threats

📉Statistics lesswrong.com·

A preliminary experiment regarding consistency as a measure of conceptual abilities in language models

🗄️Databases DEV Community·

How to Design an Effective Referral Reward System: A Complete Technical Guide for SaaS

Discussed on DEV

📊ML Research DEV Community·

FutureX · Physical AI Daily — Issue 29 (06/16)

Discussed on DEV

⚖️AI Regulation lesswrong.com·

Tactical and Operational Exploratory Modeling for AI Governance

Covers AI 2027

🧠LLM Training DEV Community·

I was fine-tuning a language model on Arabic. The loss was perfect. It spoke Chinese.

Discussed on DEV

🤖AI lesswrong.com·

How reality turns to slop

Covers Paper

🔬Anthropic DEV Community·

Can Constitutional AI Make AI Safe? Here's Why I'm More Optimistic

Covers 2 stories including https://www.anthropic.com/research/constitutional-ai-harmlessness-from-ai-feedback

Discussed on DEV

No more posts from buckman's subscribed feeds.

Scour all 25,324 feeds Learn more about Feeds

Log in to enable infinite scrolling